CDS

Accession Number TCMCG078C04289
gbkey CDS
Protein Id KAG0454553.1
Location join(2994312..2994599,2994689..2994745,2995254..2995424,2995567..2995651,2996301..2996391,2996468..2996638,2998163..2998282,2998357..2998432,3011388..3011556,3011638..3012251)
Organism Vanilla planifolia
locus_tag HPP92_023845

Protein

Length 613aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000013.1
Definition hypothetical protein HPP92_023845 [Vanilla planifolia]
Locus_tag HPP92_023845

EGGNOG-MAPPER Annotation

COG_category K
Description Auxin response factors (ARFs) are transcriptional factors that bind specifically to the DNA sequence 5'-TGTCTC-3' found in the auxin-responsive promoter elements (AuxREs)
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K14486        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04075        [VIEW IN KEGG]
map04075        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGAATCGATCTGAACACAATTGAGGCGGAAGAAGAGGACACGGCGGAGCAGCGGCCGACGGTTGCGTCGGCTCGTGGGGTGTGCCTTGAGCTGTGGCACGCCTGCGCGGGGCCGCGGATTTCTCTTCCGAAAAAGGGGAGCTTGGTGGTGTACCTGCCGCAGGGACACCTGGAGCATTTCGCTGGCGCCAGAATGTCTGGCGGCGGTGCCGGAGGAGGAGGGGTGTTGATGCGTTACGATTTGCCTCCGCACTTGTTTTGCCGTGTCATCGACGTCCAGCTACGCGCGGAGGCAGCGACTGATGAGGTCTACGCCCAGCTCTCTTTGGTTGCAGAAGACGAGATTCAGGAAGTGGGCATGGACAGAGAAGGAGAAGAACTGGATGAGATGGATGGTGAGGGCAAATCCTCAATTCCTCACATGTTTTGCAAGACCCTCACTGCTTCTGACACAAGCACTCATGGTGGCTTTTCTGTTCCTCGTCGTGCGGCTGAGGATTGCTTTCCTCCTCTGGATCACAAGCAGCAACGACCTTCTCAAGAGCTTGTTGCGAAAGATTTGCATGGTGTGGAGTGGCATTTTCGACACATATACAGGGGGCAACCACGTAGGCATTTGCTCACGACAGGATGGAGTGCTTTTGTGAACAAGAAGAAGCTTGTCTCTGGGGATGCTGTACTCTTTCTTCGGGGTGATGATGGAGAGCTTAGATTGGGCATTCGAAGGGCATCTCAACTCAAGGGCAATTCTTCCTATACGATGTCTGCAACTCAAAGTACAAGCTTTGGAGTTGGGGCTTTAGCATCATTAGCCAATGCTATCTCCACAAAGAGCACATTTCAAATTAATTACAACCCCAGGGCAAGCCAATCTGAGTTCATTGTACCATATTGGAAGTTCACAAAGAGCTGCAGTTATTCAATTTCTGTTGGTACAAGATTTAAAATGCGGATTGAGACAGAAGATGCTGCAGAGAGAAGATACACAGGCTTGATAACTGCAGTGTGTGACATGGATCCTGTCCAGTGGCCTCGGTCAAAGTGGAGATGCATAGTGGTTAGGTGGGACGACGACGATGCCCTCGACAATGGAAGACAGAATAGGGTCTCTCCTTGGGAGATCGAGCCGACGGGATCCGTCTCGGGTCCCAGCACTCTTTTAGCATCAGCCCCAAAGAGAAGCCGAATCAGCATCCCCTCGGGGAATGCTGATTACCCACATACAAGTGGGAATGCCTATATGAGCTTGGGGGAACCTGCTAGGTTCCACAAGGTCTTGCAAGGTCAAGAAATTTCGGGTTACAAAGCACCTTACAAAGAGTGCGATGTTACCCTTCCTCGTGTTGCCGATACGAGAACCAATCCTTTCCTTGAAATGAGATCCGGTGGTACTCCAAGCACATGCTTGTTACCCGTTACCGGAGGTCCGACCATCGTCTCGCCGGGGTACCCAATTTTGTCTCACGAAGGCATAGGTCTTGGGGAATCGGTAAGATTCCATAAGGTCTTGCAAGGTCAAGAAATTGTTCCGGTTCTAAGGTCCTACCGGGGAATGGGAGCTGAAAGCTGCCCATTCCTTCAAGAATGCTGCACCTTCGTGCAGCCTTGCTCTTCGACGGCTCAGGTGTCGTCGTGCTCTCCAGTTCTCAGGTTTCAGCAACCGACCCCTCAGGTGACGCATCTTCGACCGATGTACGCCACGAAGGATGGCGAGAAGGTCGACCGCCCGCGCTCCGTTCCGCTCACCCGAGCCCTCCAAAGAGCAGCAACCTGGAACTGTTACATCCATTGCTCAGAAGAATGGTCAGAGCATCTTTCATGGTGGAGGGAGCAGCTGTAG
Protein:  
MGIDLNTIEAEEEDTAEQRPTVASARGVCLELWHACAGPRISLPKKGSLVVYLPQGHLEHFAGARMSGGGAGGGGVLMRYDLPPHLFCRVIDVQLRAEAATDEVYAQLSLVAEDEIQEVGMDREGEELDEMDGEGKSSIPHMFCKTLTASDTSTHGGFSVPRRAAEDCFPPLDHKQQRPSQELVAKDLHGVEWHFRHIYRGQPRRHLLTTGWSAFVNKKKLVSGDAVLFLRGDDGELRLGIRRASQLKGNSSYTMSATQSTSFGVGALASLANAISTKSTFQINYNPRASQSEFIVPYWKFTKSCSYSISVGTRFKMRIETEDAAERRYTGLITAVCDMDPVQWPRSKWRCIVVRWDDDDALDNGRQNRVSPWEIEPTGSVSGPSTLLASAPKRSRISIPSGNADYPHTSGNAYMSLGEPARFHKVLQGQEISGYKAPYKECDVTLPRVADTRTNPFLEMRSGGTPSTCLLPVTGGPTIVSPGYPILSHEGIGLGESVRFHKVLQGQEIVPVLRSYRGMGAESCPFLQECCTFVQPCSSTAQVSSCSPVLRFQQPTPQVTHLRPMYATKDGEKVDRPRSVPLTRALQRAATWNCYIHCSEEWSEHLSWWREQL